Towards Multi-tenant GPGPU: Event-driven Programming Model for System-wide Scheduling on Shared GPUs

نویسندگان

  • Yusuke Suzuki
  • Hiroshi Yamada
  • Shinpei Kato
  • Kenji Kono
چکیده

Graphics processing units (GPUs) are attractive to the generalpurpose computing (GPGPU) beyond the graphics purpose. Sharing GPUs among such GPGPU applications is a key requirement especially for cloud platforms whose resources are utilized by various cloud users. However, consolidating recent GPU applications, referred to as GPU eaters, on a GPU poses a new challenge. Such advanced applications are designed based on the assumption that only one GPU application runs on a GPU at a time. In this paper, we present GLoop, a GPGPU framework that allows multiple GPU eaters to share a GPU. GLoop offers an event-driven programming model that generates scheduling points in GPU kernels. It achieves resource isolation among GPU applications and schedules applications by suspending/resuming GPU kernels only if necessary to reduce the number of kernel launches. The preliminary experiments demonstrate that GLoop model is applicable to a GPU eater. The results also show that GLoop enables two GPGPU applications to run concurrently on a shared GPU by interleaving GPU kernel execution.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

GPU-SAM: Leveraging multi-GPU split-and-merge execution for system-wide real-time support

Multi-GPUs appear as an attractive platform to speed up data-parallel GPGPU computation. The idea of split-and-merge execution has been introduced to accelerate the parallelism of multiple GPUs even further. However, it has not been explored before how to exploit such an idea for real-time multi-GPU systems properly. This paper presents an open-source real-time multi-GPU scheduling framework, c...

متن کامل

DSPE: DOMAIN-SPECIFIC LANGUAGE FOR PARALLEL REAL-TIME STREAM PROCESSING A Model-Driven Generative-Programming Approach for Event-Driven Scheduling and Dynamic Load Balancing on Multi-Core Processors and Graphical Processing Units (GPUs)

types are also available for the event-driven infrastructure and the parallel processing support. More information is provided in Section 5.2 and Section 6.1. Listing 3.2: C structs of DSPEElement and DSPEComponent. 1 struct DSPEElement { 2 DSPEElement *container; 3 DSPERunner *runner; 4 DSPEOwner *owner; 5 char* (*getID) (const DSPEElement *element); 6 }; 7 8 struct DSPEComponent { 9 DSPEEleme...

متن کامل

GPU-Chariot: A Programming Framework for Stream Applications Running on Multi-GPU Systems

This paper presents a stream programming framework, named GPU-chariot, for accelerating stream applications running on graphics processing units (GPUs). The main contribution of our framework is that it realizes efficient software pipelines on multi-GPU systems by enabling out-of-order execution of CPU functions, kernels, and data transfers. To achieve this out-of-order execution, we apply a ru...

متن کامل

A multi-product vehicle routing scheduling model with time window constraints for cross docking system under uncertainty: A fuzzy possibilistic-stochastic programming

Mathematical modeling of supply chain operations has proven to be one of the most complex tasks in the field of operations management and operations research. Despite the abundance of several modeling proposals in the literature; for vast majority of them, no effective universal application is conceived. This issue renders the proposed mathematical models inapplicable due largely to the fact th...

متن کامل

Vasculature segmentation using parallel multi-hypothesis template tracking on heterogeneous platforms

We present a parallel multi-hypothesis template tracking algorithm on heterogeneous platforms using a layered dispatch programming model. The contributions of this work are: an architecture-specific optimised solution for vasculature structure enhancement, an approach to segment the vascular lumen network from volumetric CTA images and a layered dispatch programming model to free the developers...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016